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Abstract 

This paper presents a framework for link management 
within hyper-media documents. The framework includes: 
(]) an object-oriented document architecture with a con- 
sistent interface for different media types; (2) a transi- 
tion model between multimedia objects founded on content- 
based static links for atemporal media, and content/time- 
based dynamic links for temporal media; and (3) intrinsic 
support for content-based access which requires a reduced 
amount of indexing. A prototype system has been imple- 
mented to verify the feasibility of the paradigm. The results 
confirm that the framework is viable and back up the for- 
malism 



1. Introduction 

Recent advances in multimedia technology have made 
multimedia documents commonplace. With the availability 
of many different media types, applications require new ac- 
cess methods and extended functionality, which are not pos- 
sible with the text-only counterparts. Existing systems (e.g., 
the World-Wide Web) already integrate heterogeneous mul- 
timedia objects within one document. However, the avail- 
able exploration methods, originally developed for text doc- 
uments, do not take into account the additional dimensions 
associated with multimedia information. 

The aim of this paper is to provide a framework for ad- 
vanced link management within a refined multimedia doc- 
ument architecture. The primary contributions are the fol- 
lowing: 

1. Based on multimedia objects, we introduce an 
object-oriented architecture for hyper-media docu- 
ments which allows for non-linear transitions between 
multimedia objects in a consistent manner. 

2. We formalize the relationships (links) between mul- 
timedia objects. Static links parameterize transitions 



only by contents and work on any type of objects. They 
are already widely used in systems like the World- 
Wide Web. For objects with a temporal dimension 
(e.g., audio, video) we introduce dynamic links, which 
parameterize transitions by contents and time. To rep- 
resent a hyper-media document with dynamic links, 
we extend the tree-like hierarchical structure to accom- 
modate the additional temporal dimension and intro- 
duce planar nodes and planar document structures. 

3. We describe the intrinsic support our framework pro- 
vides for content-based access and identify different 
query types. 

Section 2 describes the problem and presents a survey of 
other systems which handle non-linear transitions between 
different media types, content-based access and hyper-links. 
Section 3 introduces the terminology, notation and seman- 
tics within our framework. The set of operations on multi- 
media objects is described in Section 4. We summarize in 
Section 6. 

2. Problem description 

To make our objectives easier to understand, we begin 
with a short example: 

Assume you are watching a video which 
shows the northern part of the campus at the Uni- 
versity of Illinois at Urbana-Champaign. The 
camera pans slowly over the Beckman Institute 
and while the building is visible, a message win- 
dow signals that a video about the Institute is 
available and a transition to this video object is 
possible. You click on the image at this time and 
the playback of the new video starts. The camera 
enters the building and you can see the Beckman 
instruments, which are on display at the South 
entrance. While the images display these instru- 
ments, the message window shows that a text doc- 
ument about them is available. While this tran- 
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sition is possible, you click on the output win- 
dow again. The video presentation pauses and a 
window displaying this text pops up. When you 
are done reading, you close the text window and 
the Beckman video resumes playback at the point 
where it was interrupted. You see images from the 
main lobby, and the message window shows that 
a sound file is available. You decide you have 
seen enough of the Beckman Institute for now 
and select the return button. The video with the 
North campus resumes from where it was inter- 
rupted. The camera is now over the Computer 
and Systems Research Laboratory. The message 
window signals that an audio-video presentation 
of this laboratory is available. Next you see the 
Civil Engineering Building, and then the Digital 
Computer Laboratory. You decide that you would 
like to see all the videos that are available about 
DCL. You send a query to search for these objects 
and start watching the ones that have been found. 

This paper describes a framework that allows for pre- 
sentations like the previous one. We abstract a multime- 
dia document as a hierarchy of multimedia objects, which 
could be text, graphics, images, video, audio, etc. We focus 
on two aspects: links 1 between related multimedia objects; 
and content-based access. 

We use an object-oriented architecture which offers a 
high-level abstraction and is easy to integrate within exist- 
ing multimedia document standards, like MHEG [9, 13]. 
The superclass of all document objects, multimedia object, 
is an abstract class. This is illustrated in Figure 1, where we 
use the notation from [5] and the abstract classes appear in 
italics. The two subclasses correspond to different require- 
ments. The temporal subclass includes all objects which 
are intra-domain time dependent (e.g., audio, video, etc.). 
The atemporal subclass includes objects which do not have 
intra-domain time dependencies (e.g., text, static images, 
graphics, etc.). 

Based on the architecture from Figure 1, we start by de- 
scribing other systems that deal with hierarchical visual in- 
formation, non-linear content-based transitions and hyper- 
links. 

For each system, we focus on two characteristics: the 
media types of entities the system operates with, and the 
transitions between related entities. In each case, the be- 
havior is illustrated in a transition diagram which represents 
different entities as filled circles and the transitions between 
them as arrows. 

• The Slovenian Virtual Gallery [12] offers a virtual 




Images 



'The abstractions wc use are independent of the media. A link relates 
several objects which could have different media types. For this reason, 
there is no need to use the term "hyper-link." 



Figure 1. Multimedia document architecture. 
We denote the set of all multimedia objects 
with O, and the set of temporal multimedia 
objects with %. 



walk through several galleries, organized as a hierar- 
chy of images. The user controls the presentation by 
clicking on different parts of an image. This deter- 
mines the next image. The system handles just atem- 
poral information, and the only possible type of transi- 
tion is given in Figure 2. 

Transition 



Atemporal (image) Atemporal (image) 

Figure 2. Transition diagram for the Slovenian 
Virtual Gallery. 



• The Advanced Video Information System (AVIS) [1] 
organizes video data to facilitate efficient querying. A 
video is first broken down into short sequences and 
several kinds of entities (objects, activities, and events) 
are associated with them. The system defines a 9-tuple 
database [3] which supports different types of queries. 
Possible transitions are from the atemporal domain 
(queries) to the temporal domain (video sequences) — 
Figure 3. 

Transition 



Atemporal (queries) Temporal (video) 

Figure 3. Transition diagram for AVIS. 

• The Query by Image Content (QBIC) [4] is an- 
other content-based image retrieval system (CBIR) [6] 
which takes a different approach. During an initial 
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phase, called "population," images and videos are pro- 
cessed to extract features describing their contents. 
These could be colors, textures, shapes, camera and 
object motions. Queries are then composed graphi- 
cally and their features are extracted as well. A match- 
ing engine finds the videos or images with similar fea- 
tures from the database. This is illustrated in Figure 4. 

Temporal (video j 




Atempora) (image) 

Figure 4. Transition diagram for QBIC. 

• Vosaic [2] is a World-Wide Web browsing system ex- 
tended to support continuous video media. Although 
not a CBIR system, it introduces the notion of "video 
hyper-links" which allow a user to click on an image 
in a video stream and pull up another video stream. 
The corresponding transition diagram includes transi- 
tions between any combination of video and atemporal 
media — Figure 5. 



Transitions 




Atemporal (Images) Atemporal (images) 



Figure 5. Transition diagram for Vosaic. 

• For our framework, all objects in a document are sub- 
classed from the multimedia object abstract class — 
Figure 1 — and transitions are regarded in a consistent 
way, independent of the media type. Therefore, transi- 
tions between any combination of temporal and atem- 
poral objects are possible. Additionally, for temporal 
objects, we extend the notion of links and parameter- 
ize transitions by both contents and time. This is illus- 
trated in Figure 6, where the dashed arrows represent 
transitions that depend on both contents and time. 

The following two sections set up the framework. In Sec- 
tion 3, we introduce the terminology, state the assumptions 
and define several concepts. Then in Section 4 we describe 
the operations for hyper-media document manipulation. 



Transitions 




Temporal ** — " Temporal 



Figure 6. Transition diagram with dynamic 
links. 



3. Definitions and assumptions 

We define the class of multimedia objects according to a 
composite model. An entity within our hyper-media docu- 
ment model can be either a single multimedia object (Fig- 
ure 1) or a collection of multimedia objects. 

The basic unit of a temporal object is a frame. A set of 
frames creates a strand, a set of strands creates a ring and 
a set of rings tied together by synchronization information 
creates a rope. In the rest of this paper, we use the terminol- 
ogy given in Table 1. 

Dynamic links include a temporal dimension. They pa- 
rameterize the relationships between objects by contents 
and time. Assume % = {r\ , . . . , r n } the set of temporal 
multimedia objects, where % C O (these sets are figured 
as dashed boxes in Figure 1) and T = {r; r = [£,,£;]} 
the set of time intervals corresponding to these objects. The 
function d* :HxT O models the links between objects 
in 1Z and objects in O. 

For example, if r e TZ is a multimedia object with r 6 
7\ T = [0,200) seconds: i(r,n) = o u n = [0,25), 
di{r>T 2 ) = o 4 , r 2 = [80,100) and 4(r,r 3 ) = oio, r 3 = 
[100, 200). This defines a dynamic link from r to oi for the 
first 25 seconds; no link for the next 55 seconds; from r to 
04 for the next 20 seconds; and from r to 010 for the last 100 
seconds. 

Static links are a specialization of dynamic links, where 
the space is atemporal and the transition depends just on 
contents. Formally, they are modeled by 5/ : O -> O. 

We represent dynamic and static links with association 
maps [1]. An association map A specifies which objects 
correspond to which strands when they take part in links. To 
formally define the association map, we make the following 
assumptions: 

1 . The set of frames (raw data) is the set {1, 2, . . . , n] for 
some arbitrarily chosen, but fixed, integer n. 

2. A strand represents the set of all frames between i (in- 
clusive) and j (exclusive), where l<i<i<nare 
frame indices. The strand is then denoted by the pair 
[i, t is the start of the strand and j is the end of the 
strand. 
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Medium 


Any form of information that is manip- 
ulated by multimedia systems and can 
be broken down into frames (e.g., audio, 
video, control information for servo sys- 
tems and mechanical actuators, etc.). 


Frame 


Basic unit of continuous medium. Oper- 
auons on irames ore reaa oiiu write. 


Strand 


Immutable sequence of ordered 
frames — the immutability of strands 
means that the frames inside a strand 
can not be referenced as stand-alone 

PTititip<i "Frfvm (Tip oiit^irfp OnprjitinnQ nn 

strands are playback (which include fast 
forward and rewind, etc.), record, set 
and reset the reference. 


Ring 


A collection of n > 1 strands. The col- 
lection is ordered and the order of the 

pfimnocino ^trnnHc i« fiYi*H— -osm tint Kf* 

modified once the ring is created. An- 
other attribute is meta-data. 


Rope 


A collection of n > 2 rings (of the 
same or different medium^ tied together 
by synchronization information. Ring 
synchronization could be either forced 
(by means of a clocking device) or au- 
tomatic, and is handled at lower levels. 


Meta-data 


Additional information about a ring 
(e.g., name, description, keywords, etc.). 
Some fields are read-only and can not be 
modified (e.g., frame rate, etc.). 


T inlr 


A discrete function that formalizes the 
relationships between objects. In this 
context, by discrete we mean that the 
function values are references to individ- 
ual objects. Depending on the type of ar- 
gument, a link is either dynamic or static. 


Composite 
object 


A hierarchy of objects tied together by 
links — a multimedia document is a com- 
posite object as well. 



Table 1. Terminology. 



3. We define a partial ordering C on the set of all strands 
such that [<i,ii) C [i 2 ,i 2 ) if *i < ji < * 2 < i 2 . If 
ji 7* t 2 and [ii, ji) C [i 2 ,i 2 )> then [%uj\) C [t 2 , j 2 ). 

4. The set of multimedia objects is the set O = 
{oi»02» • • • yO m } for some arbitrarily chosen, but 
fixed, integer to. 

Under the assumptions 1-4, we define the following: 



Definition 1 

A set of strands S is well ordered if: 

1. S = ji), [is* ia)« - ■ ■ , for some integers 
(S is finite); 

2. [ii.ii) .E [12, ja)E...C[* i1 j.).. 
Definition 2 

A set of strands 5 = {[hj'i), [i 2 ,i 2 ), . . . , [i 8 J a )} is solid 
if: 

1. S is well ordered; 

2. There is no strand in the form [i 1 , i 2 ) and [i 2 , 13 ) . 

Definition 3 (Association map) 
An association map is a function X :0 S x1Z such that 
for each multimedia object Oi € O, the strands in A(o») are 
a solid set 

For example, 

= {(r*i[«s,is)),(ri, [»7, j?)), (rm,[io,i9))} 

means that oi occurs in all strands [43,^*3), [17, jV) and 
[Hj39) — the first member of a pair (r*, [i X) j y )) is the ring 
that contains [i zi j v ). 

The advantage of using association maps to model links 
is twofold: 

• Association maps correspond to fine segments on the 
x-axis of the Cartesian plane. From a database per- 
spective, they can be efficiently stored by any method 
for storing collinear line segments [ 1 ] . 

• For any given object, the association map determines 
all the referencing strands. This allows discarding of 
non-referenced objects with reference-based garbage 
collection mechanisms. 

A document containing just static links is organized in 
a tree-like hierarchy. Each node has a coordinate on the 
contents axis. A static link between two objects establishes 
a connection between their corresponding coordinates — see 
Figure 7a. However, if dynamic links are also present, this 
model can not represent the additional dimension, time. We 
define planar nodes, which have two. dimensions and there- 
fore correspond to line segments in the contents-time plane. 
For any given node, the contents coordinate is constant, like 
in the previous case. A dynamic link between objects estab- 
lishes a connection between a time segment and a point on 
the contents axis — see Figure 7b. 

Figure 8 illustrates a tree with nodes corresponding to 
static and dynamic links: For each node, we show the pro- 
jection of the coordinates on the time axis. This yields a 
single point for static links and line segments for dynamic 
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Figure 7. Tree nodes for static links (a) and for 
dynamic links (b). 



continuous media (digital video and audio) by providing fa- 
cilities for creating, editing and retrieving multimedia ob- 
jects [1 1] use an identical structure and therefore no inter- 
mediate conversion layer is required. 

4. Operations 

We classify hyper-media document management opera- 
tions into two categories: low- level operations which are 
available at the application level, and high-level operations 
available to application users. 



Root level 




I [ Text1 1 [ Ringl j [imapel j [ Rope 1 ] j 









• 
i 


< 


Static link 


MS' 


Dynarc 


cDnk Dynamic fink 

n — mi 





Figure 8. Tree with 1-D (contents) and 2- 
D/planar (contents, time) nodes. 



links. The time segments which are associated with other 
objects are shaded, while the ones with no associated ob- 
jects are left white. 

Therefore, a hierarchical multimedia document with dy- 
namic and static links can be represented as a tree with pla- 
nar nodes (i.e., planar document structure). In the case of 
dynamic links, nodes store the information corresponding 
to the additional dimension (time) and represent points in 
a 2 -dimensional space. The structure with tridimensional 
nodes is compatible only with static links and can not ex- 
press the temporal information required by dynamic links. 

The links (dynamic or static) model relationships be- 
tween objects in a top-down manner. For a given object, 
they determine the complete hierarchy of associated ob- 
jects, allowing for resource allocation and reservation, pre- 
fetching and caching. 

Structuring the media as frames, strands, rings and ropes 
(see Table 1 j not only offers a high-level abstraction for the 
raw data, but also allows for good performance from a low- 
level storage perspective. File systems designed to support 



4.1. Low-level operations 

The low-level operations include basic operations for 
manipulating strands, such as Record, Playback, Stop, 
Fast Forward and Fast Rewind. They also include op- 
erations to establish or remove links associated with a cer- 
tain strand. The definitions of these operations are listed in 
Table 2. 



Record 


Begins recording n > 1 multimedia 
strands. Recording continues until a sub- 
sequent Stop operation is issued. 


Play 


Retrieves a previously recorded set of 
n > 1 strands and processes each of them 
according to the medium type. 


Fast 
Forward 


A variant of Play that retrieves and prcn 
cesses every other k frame. 


Fast 
Rewind 


A variant of Play that retrieves every 
other h frame, starting with the last frame 
of each strand and progressing towards 
the first frame. 


Stop 


Issued to interrupt a previous command 
(Record or Play variant). 


Set 

Reference 


Establishes a link from a strand to a given 
multimedia object. 


Reset 
Reference 


Removes the existing link to a multime- 
dia object. 



Table 2. Low-level operations. 



4.2. High-level operations 

The entities available at the application level are tem- 
poral (e.g., strands, rings and ropes) and atemporal (e.g., 
text, graphics, static images, etc.) multimedia objects. Ap- 
plications allow operations such as creation, composition 
and presentation. A document (a set of objects and a set of 
links — relationships — between them) can also be regarded 
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Link 



Unlink 



Establishes a link between two objects. 
Static links require just the objects. Dy- 
namic links require the time coordinate as 
well. 



Removes a previously established link. 



Table 3. High-level composition operations. 

as a relational database [3]. Therefore, support for content- 
based access operations is intrinsic. The entity types corre- 
spond to each object's media data and the meta-data associ- 
ated with it. 

High-level operations are divided into two groups: gen- 
eral operations which allow overall manipulation of mul- 
timedia objects (such as creation and presentation), and 
content-based access operations which allow different 
types of queries. 

General Operations. 

Creation defines the construction of high-level multime- 
dia objects. In the case of temporal objects, the basic 
building blocks are strands. First they are recorded 
with Record and then assembled together to form 
high-level objects with Concatenate. Atemporal ob- 
jects are imported from other applications — their cre- 
ation is beyond the scope of this framework. 

Composition combines simple objects (temporal or atem- 
poral) into composite objects — Table 1. The user con- 
structs the hierarchy by specifying links between ob- 
jects. The corresponding operations (Link and Unlink) 
are denned in Table 3. 

Presentation defines the operations that are available for 
exploring a composite multimedia object, such as 
Playback, Jump, Return, Plunge, meta-data ma- 
nipulation and Query. Once the hierarchy has been 
constructed through composition, the user can interact 
with it in various ways. These operations are described 
in Table 4. 

For example, assume that a video ring r\ consists of 
three strands — see Figure 9. The first strand AC is related 
to an HTML document doc . html, and the third strand 
DI to another ring r 2 (audio). The user adds a link to 
doc . html in AC and a link to r% in DI. Therefore, the 
dynamic link of t\ is denned as follows: 



>[dh) = 



doc.html 

T2 



and the association map: 



Playback 


Sequentially retrieves and presents the 
strands corresponding to a high-level ob- 
ject Whenever a strand is associated with 
another object, feedback is given to the 
user. This operation includes all variants 
available at object level (fast forward and 
fast rewind). 


Jump 


Changes the context (following a link) 
from the object being played back to the 
one associated with it, advancing one 
level down in the hierarchy. Whenever 
the lower level object ends, the playback 
of the referencing object is resumed. 


Return 


Changes the context from the object be- 
ing played back to the one that referenced 
it, advancing one level up in the hierar- 
chy. This is not possible for the root ob- 
ject, which is not referenced by any other 
object 


Plunge 


Variant of playback which does not return 
to the referencing object. 


View data 


Displays the meta-data for a high-level 
object 


Change 
data 


Allows the user to change the meta-data 
fields which are not read-only. 


Query 


Provides services for content-based ac- 
cess. 



A(doc.html) = {(r u [AC\)} 
A(r 2 ) = {(ru[DI))} 



Table 4. High-level presentation operations 



When ri is played back, the application provides feedback 
about the links within AC and DL Assume Jump is se- 
lected at t = B, A < B < C. The first link is taken and 
the presentation changes from ri to doc . html. Whenever 
Return is selected (because this object does not have a tem- 
poral dimension, its presentation needs to be terminated by 
the user), the playback of n resumes at B. Next, assume 
Jump is selected at t = E, D < E < L The second link 
is taken and the audio corresponding to r 2 is played back. 
If Return is selected at t - G, F <G < H (before com- 
pletion), then the presentation of r 2 ends and ri resumes at 
E and finishes at J. 

Content-based Operations. Queries can be classified ac- 
cording to different criteria. We include examples for sev- 
eral query types. 

Type of Information involved In a query. Depending on 
their type, some queries require an additional process- 
ing step for all objects in a hierarchy, similar to the 
indexing stages described in [6]. 

Queries on object information take into account just 
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rt (video) r2 (audio) 




HTML Document Strand 2 



Figure 9. Example of hierarchy with 3 objects 
(two rings and a document). 



the media data. Unlike other systems [6], they do not 
require additional meta-data information and therefore 
work on any document structure. 

Query 1 

Return all objects that contain a 
given strand 

Queries on meta-data require objects to contain valid 
information (no NULL values 2 ) in the meta-data fields 
that are selected. 

Query 2 

Return all objects created after 
March 21, 1997 

Queries on mixed information take into account all the 
information about objects. Again, they require meta- 
data to be present. 

Query 3 

Return all objects that contain a 
given strand and were 
created after March 21, 1997 

Granularity of results. Object-level queries return entire 
objects (e.g., text document, rings, ropes, etc.). Sub- 
object-level queries return parts of objects (e.g., a sec- 
tion of a text document, strands, etc.). 

5. Partial verification and results 

In this section we describe a partial implementation of 
the framework. First we argue mat, although partial, the 
verification does not restrict the generality and demonstrates 
that the framework is feasible. Then we cover several 
implementation-specific details. We conclude with an ex- 
periment and results. 

2 If an object does not define a particular meta-data field, then the value 
is considered NULL. 



5.1. Multimedia objects 

As stated in Section 2, our framework regards the multi- 
media objects in a consistent way, independent of the media 
type. However, one of the novel aspects introduced in the 
previous sections is dynamic links, which are defined for 
temporal objects — Section 3. Existing systems already im- 
plement static links (e.g., World-Wide Web) and therefore 
their verification is not required. Consequently, in order to 
implement and test dynamic links, we use just temporal ob- 
jects. 

We restrict the verification to video information. Besides 
supporting dynamic links, this choice also allows testing 
video to video transitions, which just a few other systems 
support [2]. Restricting to one media type does not decrease 
the generality. Rather, it simplifies the implementation. The 
details associated with manipulating other types of media 
(e.g., audio, text, etc.) are left aside. 

5.2* Design patterns 

An essential framework attribute is a high degree of flex- 
ibility, such that the framework is usable for a wide range of 
applications. Design patterns help to achieve this objective, 
emphasizing the reuse of successful designs and architec- 
tures [5]. 

Our document architecture is an instance of the Compos- 
ite pattern: each element of the hierarchy could be either 
a simple object or a composite object. Composite ensures 
an uniform document interface and enables for any combi- 
nation of media types. It also allows to apply the Visitor 
pattern, which decouples the document's structure from the 
high-level operations — Section 4.2. Consequently, defining 
new operations does not require changes on the document 
side. This flexibility is important in the context of large doc- 
ument databases, where recreating the entire database is an 
expensive operation. 

53. Implementation 

For our prototype system, we have chosen an object- 
oriented architecture and design. A high-level toolkit like 
Motif [7] is the best candidate for the user interface, which 
focuses on ease of programming and event-driven process- 
ing. Likewise, media data processing requires fast and reli- 
able code. C++ and the Standard Template Library [10] are 
an excellent choice for these requirements. 

The strands are motion-JPEG compressed video se- 
quences. The absence of temporal prediction allows for 
frame-level granularity. 3 For our implementation, we use 

3 Tne framework does not depend on strand granularity. In case tem- 
poral prediction is used (e.g., MPEG), the granularity is higher (GOP for 
MPEG). Because the strands are immutable, this does not affect higher- 
level objects. 
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this format because of the availability of hardware-assisted 
operations, 4 which allow for fast processing. 

Processing of the media data associated with digital 
video has to take place in a timely manner, according to 
the soft deadlines typical to multimedia systems [13]. The 
scheduling solution used in our prototype relies on the X 
timer callbacks [7, 14]. Although not optimal, this approach 
has the advantage of being portable and has been adopted 
by other multimedia applications [8]. However, our imple- 
mentation is generic and could be easily modified to take 
into account real-time services, in case they are provided by 
the system. 

5.4. Results 

To verify the framework, we performed a series of exper- 
iments with the system described in 5. 1 and 5.3. The exper- 
iments involve three 5 minute video rings, ri , r% and r3. r% 
consists of 4 strands: n = {si , «2, «3, s 4 } and its dynamic 
link is: d,(n, [120, 180]) = r 2 , d,(r 1( [240,300]) = r 3 
(time intervals are in seconds). The image size corre- 
sponding to each video is 320 by 240 pixels, at a rate of 
20 frames/second. 

We were interested in observing how intuitive the dy- 
namic link between different video objects is for the user 
and how fast our system can descend in the hierarchy 
(change the context from one video ring to another). The 
program executed on an HP workstation and the video ob- 
jects were served by an HP multimedia server, via NFS. We 
ran about 50 experiments and used as metrics the delay for 
Jump and Return operations. The average measured val- 
ues are 2.5ms and 1.2ms respectively. 

The results show that the context changes are fast (mil- 
lisecond range), hence the transitions from one ring to an- 
other are smooth and satisfy the user's visual perception. 
The difference between the two measured times are caused 
by the different low-level operations associated with each 
action (i.e., file open, file close) and are implementation- 
dependent. 

6. Summary 

We presented a framework for advanced link manage- 
ment in hyper-media documents. Trie framework is based 
on an object-oriented document architecture, supports non- 
linear transitions between multimedia objects and has in- 
trinsic support for content-based access. Non-linear transi- 
tions are formally represented by dynamic links and static 
links. To represent the additional temporal information as- 
sociated with the dynamic links, hyper-media documents 
are structured as trees with planar nodes. A prototype sys- 
tem shows that: (1) the framework is viable for hyper-media 

4 We use hardware from Parallax Graphics, Inc. 



documents which contain temporal objects; (2) the dynamic 
link abstraction is feasible; and (3) the overall system per- 
formance satisfies the user's visual perception. 
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